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Abstract 

In the Internet era, online social media emerged as the main tool for sharing opinions and information 
among individuals. In this work we study an adaptive model of a social network where directed links 
connect users with similar tastes, and over which information propagates through social recommendation. 
Agent-based simulations of two different artificial settings for modeling user tastes are compared with 
patterns seen in real data, suggesting that users differing in their scope of interests is a more realistic 
assumption than users differing only in their particular interests. We further introduce an extensive set 
of similarity metrics based on users' past assessments, and evaluate their use in the given social recom- 
mendation model with both artificial simulations and real data. Superior recommendation performance 
is observed for similarity metrics that give preference to users with small scope — who thus act as selective 
filters in social recommendation. 

Keywords: Taste similarity, agent-based modeling, adaptive complex networks, information diffusion 
and filtering, social recommendation. 

1 Introduction 

Online social networks (Facebook, Twitter, Digg, and others) are nowadays popular media which allow opin- 
ions and information to spread rapidly to friends, friends of friends, and further, eventually reaching a huge 
audience by cascading chains of influences [TJ [2]- Furthermore, they are able to reflect moods and current 
beliefs of people, giving early signs of what is just going to happen [3J 2]. The study of the structure and 
evolution of social networks has hence attracted much more attention than in the past, becoming a main 
focus of complexity science. To this end, agent-based models (ABM) represent an ideal tool for simulating 
the social behavior of autonomous individuals and the interactions among them [5] , and have been success- 
fully used to model decentralized scenarios where individual interactions lead to the emergence of collective 
patterns [6l [7] . When modeling social networks however one should be aware that they differ from other kind 
of complex systems because of two features: taste affinity and social influence between users. Taste affinity 
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(also named homophily or selection in the literature) is the tendency of individuals to form relationships 
with others who posses similar characteristics [8] . Social influence is the process which leads people to adopt 
behaviors exhibited by those they interact with [5]. Taste affinity and social influence not surprisingly lead 
to correlation between actions of adjacent nodes in the network, and hence are nowadays widely recognized 
to play a fundamental role in the formation and evolution of social systems |1Q[ 111] . 

In this work we focus on taste affinity between users. While past studies of homophily in online social 
network have been based on the analysis of emotionally annotated data [121 EH [H] , here we build our study 
on the adaptive model of social recommendation introduced in |15l 116) . The model consists of a network 
of users, each connected to other users who represent her information sources — which we name the user's 
leaders. Users in the community interact with some kind of content — we use news here, but they can be 
also movies, books, bookmarks, and so on. The propagation of a news over the network starts from the 
user who uploads it to the system and then automatically forwards it to her followers — the users who have 
chosen her as leader. If one of these followers later likes the news, it will be similarly passed further to this 
user's followers, and so on. This social spreading/filtering mechanism is known as social recommendation, 
and it is based on the evidence that users prefer recommendations made by friends rather than those by a 
centralized system [T7l [18] . Simultaneously with the propagation of news, connections of the leader- follower 
network can change when, for any user, a leader better than her current ones is found. The quality of the 
candidate leader is estimated by comparing the users' past assessments to obtain their taste similarity. Due 
to this mechanism, the evolution of the social network and the dynamical processes taking place on the 
network become coupled, as it happens in real social communities [191 120] . This gives rise to a rich variety 
of features, for instance the emergence of a "good get richer" mechanism which leads to scale-free leadership 
structures — again observed in real social networks [211 E2] ■ 

To study the evolution and dynamics of the system, we first turn to computer simulations and introduce a 
general agent-based framework for describing users' tastes. We build on the assumption that users' tastes 
and news' attributes can be represented by vectors, and that the assessment of a user on a news depends 
on the overlap between the respective vectors. A fundamental issue is hence how to construct these vectors. 
Usually one takes inspiration from real data, where dimensionality reduction techniques (Singular Value 
Decomposition, Probabilistic Latent Semantic Analysis and Latent Dirichlet Allocation) can be employed to 
extract analogous features from the given data [33]. However the result of this approach strongly depends 
both on the number of used hidden variables and, more importantly, on the particular dataset analyzed. 
Here we take the opposite approach: instead of seeking for a perfect model of users' interests, we study 
the system under two different plausible models of users' tastes. We then apply our social recommendation 
model to real data and determine which taste model better describes reality. 

The way of modeling users' tastes determines users' actions and, at last, the behavior of the whole sys- 
tem. However taste vectors are hidden variables both in our model and in reality. We hence tackle the 
opposite issue as well: given the users' actions, how to quantify users' taste similarity? In the literature, 
there arc a number of definitions for taste similarity based on users' assessment: the probability of agreement, 
the cosine similarity, the Jaccard coefficient, the Pearson correlation coefficient, to give a few examples, and 
each definition is apt for a particular setting. For instance, Jaccard's works with implicit ratings, whereas 
Pearson's exploits explicit assessments. Assuming that users may like, dislike or not rate an item, we make 
a comparative study of a number of similarity metrics. We use both simulations and real data to determine 
which of them are better for building an efficient leader-follower network for social recommendation. We 



2 



show that using a well-chosen definition of the similarity can significantly enhance the quality of the produced 
recommendations, and ultimately improve users' satisfaction. 



To summarize, our contribution is twofold: 1) we study two different ABMs for social systems built upon 
plausible assumptions about users' tastes, determining which one is closer to reality, and 2) we identify a set 
of similarity metrics which are suitable to measure taste affinity, and which can be used to obtain quality 
recommendations both in simulations (under different ABM assumptions) and in real case studies. The rest 
of the work is organized as follows. In Section 2 we describe the model for social recommendation along 
with different ways of modeling users' tastes and measuring taste similarity. Section 3 contains results of 
agent-based simulations, whereas Section 4 presents results obtained on real datasets. Conclusions are given 
in Section 5. 



We now describe the social recommendation model originally introduced in |15) . 

Notation. We denote as U the total number of users, L the number of leaders assigned to each user and 
Si\j the estimated similarity of reading tastes of user i with respect to those of user j — obtained from i's and 
j's assessments of news (see Sec. 12. ip . Evaluation of news a by user i (e< ) can be either +1 (liked), —I 
(disliked) or (not rated yet). The model assumes no other information than ratings of news by users. 

Network structure. The set of U users is represented as a monopartite social network, in which each user 
i is connected by directed links to L other users who represent i's leaders (information sources). Conversely, 
users who have i as a leader are referred to as followers of i. Thus, a directed edge points from j to i (j — > i) 
if user j is a leader of user i or, equivalently, i is a follower of j. 

News spreading. The directed leader-follower network serves as the basis for news propagation. When 
user k introduces news a to the system, the news is automatically passed to all followers of k. For follower 
j, the social recommendation score of news a is set proportional to j's similarity with the news's submitter: 
Rj a oc Sj\j. (the higher the Syife, the stronger the recommendation). If a is later approved by user j, it is 
similarly passed further to all users i who are followers of j with recommendation score proportional to their 
similarity with j: Ri a oc Sju. If a is approved by one of users i, it is passed further to i's followers, and so 
forth. Note that if a user receives the same news from multiple leaders, the news's recommendation score 
becomes proportional to the sum of the similarity scores between this user and the leaders who forwarded 
it, reflecting that a news approved by several leaders is more likely to be approved by this user too. Finally, 
to allow fresh news to be accessed fast, recommendation scores are damped with time with a function 
(1 — T _1 ) t_t °, where t represents the current time, t a is the time news a was introduced in the system, and 
t is the time scale of the decay. Summing up, for a generic user i at time t the recommendation score for 
news a is: 



Here Li is the set of leaders of user % and the term 5 eja ,i is one when user j liked news a {&j a — I) and zero 
otherwise (5 is the Kronecker symbol). Recommendation scores are used to sort the unread news each user 
receives (high scores at the top). 

Leaders updating. Simultaneously with the propagation of news, connections of the leader-follower net- 
work are rewired when, for any user, a leader better than her current ones is found. In practice this means 
that if j is the leader of i with the lowest similarity value, j is replaced with a new user fc if Sjij, > Sju (link 
j — > i is cut and k — > i is created). As selection strategy for picking new candidate leaders, we employ a 



2 The Model 




(i) 



3 



so-called Hybrid rewiring [23], which is composed 90% of LL + F local rewiring (k is chosen from the leaders' 
leaders and the followers of user i such that Sji^ is maximized) and 10% of random rewiring [k is simply 
a user picked at random from all users). This rewiring mechanism mimics the natural evolution of social 
communities where users search for friends among friends of friends (local rewiring) but also casual encoun- 
ters occur and may lead to long-term relationships (random rewiring). By construction, the Hybrid rewiring 
naturally enhances both the link reciprocity [25] and the clustering coefficient [33] of the social network [23] . 
In addition, it was shown to represent an optimal compromise between complexity and efficiency |24) . Of 
course, there are more sophisticated models of network evolution, which are based on statistical methods to 
estimate the model's parameters from empirical data [571 HH]- However, since it is not our goal to model 
a particular network, we stick on simple and general evolution rules — which nevertheless were shown to 
produce network topologies with high values of link reciprocity and clustering coefficient, which thus closely 
resemble the topologies of real social communities [24] . 



2.1 User Similarity 

An essential ingredient of the model is the estimated similarity of users' reading tastes, which regulates the 
news' flow over the system by determining the leaders' selection from users (i.e., the link structure of the 
network) and recommendation scores of news. Since we can rely only on users' ratings, the similarity of 
a pair of users has to be estimated from their assessments on news — which in our case can be either like, 
dislike, or not rated. 

The first similarity measure that we consider, first introduced in [15] . is the overall probability of agree- 
ment between users i and j. If ij = {ai|ej a = 1} and = {a|e,- a = —1} are respectively the set of news liked 
and disliked by user i, then: 



(o) = IhfW + \ d iC\ d j\ m 



where the denominator is the total number of news read by both i and j . We then consider the Jaccard co- 
efficient, which however only works with binary ratings. By considering only like instances (i.e., disregarding 
dislikes) we get: 

t(o) ._ \h[W ( o\ 

iW ""liiUiil [ ) 

which represent the ratio of the number of news liked by both i and j over the number of news liked by 
either of them. The opposite procedure (disregarding likes) gives: 

5(0) \dif\dj\ U) 
*U • \di\Jdj\ ( > 

We have tested this similarity and obtained poor results (not reported in this paper), which is not surprising. 
The social recommendation mechanism is in fact based on users' filtering: leaders forward to their followers 
only the news they like. This means that a news liked by leader j is always passed to follower i, who will 
possibly evaluate it. However if j does not like a news, i does not receive it directl}0 and it is less likely 



1 In principle, i could still receive the news from another leader; however, such leader has to receive the news first, and then 
like and forward it. This longer path is hindered both by the time-damping of recommendation scores and by the eventuality 
that leaders of the same user have similar tastes among themselves. 
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that i will ever evaluate it. For this reason, the sets \li lj\ and \di P| lj\ are much larger than \di P| dj\ and 
| li f] dj | , and a similarity score based on either of the first sets is much more reliable in measuring taste 
affinity than a score based on either of the second sets. Hence in what follows we will not consider measures 
based only on \di f] dj | or \lif]dj\. 

Coming back to we do not expect it to work better than because it exploits less information. 

We instead use as a baseline for introducing other similarity metrics: 



J 



J 
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Note that s^°\ and are symmetric measures: s5P? = 



ji, 0) = J<°> and J$ = J5,V. Instead j( 2 ) 



and arc not symmetric — what happens is that J^) = J^) and vice versa. Since our network is directed 
(allowing for situations where j is a good leader for i but not the contrary) , considering asymmetric similarity 
scores is more than appropriate. We pursue this direction further by introducing asymmetric equivalents of 
S (°) and of J(°>: 



S i\j 
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(OL) , (OF) 



is the probability of follower i and leader j agreeing on the news assessed by leader j 



Here s., . 

(follower i Jt wiroicao u tU v _ 4|j 
(follower i). Asymmetry here translates into s 



whereas {J^\ F ^) is the probability of follower i (leader j) liking a news liked by leader j 



(OL) 



s^ F) and J ( ° L) 



(OF) 



We finally consider asymmetric measures which, as for J*- 2 ' and J^ z \ combine the sets k, lj, di, 

dj as separate algebraic contributions. There are many possibilities to construct such similarities but as we 
said before, news liked by leader j arc the ones which are recommended to follower i: it makes more sense 
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to use the probability of i liking & news liked by j. 
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The meaning of each definition is straightforward: 


k ( ; l) is 


equal to Jju^ plus the probability of follower i 



disliking a news disliked by leader j; K^. is equal to J\. minus the probability of leader j liking a news 
disliked by follower i; and so forth. We remark that this list of similarities is not intended to be exhaustive. 
H Instead, we considered various definitions containing all the combinations of the sets U, lj, di, dj which 
are meaningful in our opinion and have the potential to achieve a reasonably good performance. 

Table [1] gives a summary of the similarity metrics that we introduced above. Note that all definitions 
of the similarity are prone to statistical fluctuations when the set of news considered for the assessment is 
small, i.e. when the denominator of an algebraic term is small. Since in a sampling of n trials the typical 
relative fluctuation is of the order of 1/n, we multiply each separate term of all introduced similarities by 
a factor (1 — rt -1 / 2 ), where n is the denominator of the considered term. Such factor disadvantages user 
pairs with a small n. Finally, when we do not have enough evaluation for a user pair (meaning that their 
similarity is undefined) we set it to a small positive value so, reflecting the fact that even when there are no 
users' evaluations, there is some base similarity of their interests. 

2.2 Agent-Based Validation of the Model 

For simulating the model, we use the following agent-based framework. 

Users' activity and News' diffusion. In each time step of the simulation, a given user is active with 
probability pa- When active, a user reads the top R news from her recommendation list, immediately 
forwarding the ones she likes to her followers; in addition, with probability ps she submits a new piece of 
news. Users have a limited receiving capability: if the number of recommended news becomes greater than 
the maximum users' stack length S, the news with lowest score is removed from the recommendation list. 

2 There are many other ways to assess users' similarities from the user-object bipartite graph and the user-user social graph: 
the generalized model of relational similarity 1291 . blockmodeling 1301 . the CONCOR algorithm |31l . automorphic equivalence 
1321 . Correspondence Analysis |33| . and many others. However, these methods require as input the whole dataset of users and 
objects, also to evaluate the similarity of a single user pair, and are much demanding in terms of computational cost. For 
this reason, they are not applicable in practice to online social networks with millions of users and continuously generated 
content — and thus have not been considered in our analysis. 



6 



symbol 



Tclbic 1; List of similarity metrics. 

formula 



7(0) 



no • 'i.n'ij 



no 
no 



+ \d t f]d 3 \ + \d l f]l 3 \ + \hf]d J 



uo 



nO I , \dif)dj 



uo 
no 



di U d 3 1 

d,noi 



uo 
no 



rf, u 

n <o 



uo 



r\ij\ + \dir\dj 



101 

noi 



\dj\ 

\diC\dj\ 



no 



1^1 

ono 



/.HOI , MifK 



01 

noi 
01 

noi 

rl'oi 
01 

noi 
rloi 



+ 



\d A \ 

i^rni 

1*1 
i^noi 

i^noi 
1*1 

kOI 

lon^i 



7 



Users' network and Links' rewiring. At any moment, the leaders of each user j should be those L users 
who have the highest taste similarity with j. Lacking any prior information, we assume random initial 
assignment of leaders. As the system gathers more evaluations from users, the quality of leaders can be 
assessed through the similarity metric, and links can be rewired (with the described hybrid strategy) in 
order to connect users to their best leaders. Rewiring of the links takes place after every u time steps. 

Users' tastes and News' attributes. To model users' judgments of read news we use a vector model where 
tastes of user i are represented by a D-dimensional taste vector £j = (tj, . . . , tf) and attributes of news a 
are represented by a D-dimensional attribute vector a a = (a* , . . . , a®). Similar vector models are often used 
in semantic approaches to recommendation |34) . Opinion of user i about news a is based on the overlap of 
the user's tastes and the news's attributes, which can be expressed by the scalar product 

fl ia =ti-a a (19) 

We assume that user i approves news a (ej a = +1) only when £li a > A; the news is disapproved otherwise 
{e-ia = — !)■ Here A is the users' approval threshold: the higher it is, the more demanding the users are. To 
set the taste/attribute vectors, we use two different approaches. 

• Homogeneous setting. Each user has preference for Da out of D available tastes — that means, each 
taste vector has Da elements equal to one (active tastes) and the remaining D — Da elements equal to 
zero. There are ) such vectors and hence we set the number of users in our system to U = ) , all 
having mutually different taste vectors. Note that the taste vectors of two users differ at least in two 
elements, we thus define as taste-mates users with exactly two different taste vector elements. We also 
assume that the attributes of a created news a are identical with the tastes of the user i who submits 
it (a Q = ti). 

• Heterogeneous setting. Each user has preference for a variable number of the D available tastes — each 
vector has a different number of elements equal to one (active tastes) and the remaining elements are 
zero. Here users differ not only by their particular tastes, but also by the scope of their interests — 
which, for a user i, is defined as hi := Y]?_-, t\ and represents the number of her active tastes, i.e., the 
area covered by her interests^ There are 2 D possible taste vectors and hence in principle we would 
set U = 2 D to have all mutually different taste vectors, with taste-mates now defined as users having 
only one different vector element. However in this scenario we are also considering users who like or 
dislike everything (i.e., the users with taste vectors of all ones or zeros, respectively). Hence in order 
to be more consistent and closer to life, we only consider the users whose taste vectors have a scope 
Da < h < D — Da- This reduces the effective number of users to U = 2 D — 2 J2d=o' 1 Cd)- We finally 
assume that each news' attribute vector has a fixed number Da of active attributes (number of ones), 
which are randomly chosen among the active tastes of the user who submits it. 

Parameters values used in all following simulations are listed in Table [3] For a discussion on how these 
parameters may affect simulation results, see [2~I] . 

3 hi = Dfy Vi in the homogeneous setting: each user has the same number of active tastes. To better illustrate the difference 
between taste (or interest) and scope of interests (or, simply, scope), consider user i who has preference for news about politics 
and economics, and user j who instead prefers news about economics, technology and science. Then the tastes of i are exactly 
politics and economics, and we can say that i and j have the same taste for economics (they are both interested in it) and, for 
instance, for sport (they are both not interested in it). The scope of user i is instead equal to two, as she has preference for two 
kind of news), and the scope of j is three. We can thus say that i has smaller scope than j, and that j has bigger scope than i. 
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Table 2: List of parameters used in simulations. 

symbol value 



Homogeneous setting: 

Number of users U 3003 

Dimension of taste vectors D 14 

Number of active tastes Da 6 

Users' approval threshold A 4 



Heterogeneous setting: 

Number of users U 3498 

Dimension of taste vectors D 12 

Minimum number of active tastes Da 4 

Users' approval threshold A 3 



Common parameters: 

Number of leaders per user L 10 

Probability of being active pa 0.05 

Probability of submitting a news ps 0.02 

Number of news read when active R 3 

Length of recommendation list S 50 

Period of the rewiring u 10 

Damping factor for time decay r 10 

Initial value of the similarity so 10~ 7 



2.3 Measures of the System's Properties 

We measure a number of quantities to describe the leader-follower network and the performance of the social 
recommender system: 

• Leadership Structure, measured by the cumulative distribution F(k) = P(f > k), where / is the 
number of a user' followers. 

• Number of dead ends (de) in the network, i.e. of users with no followers (/ = 0). We are interested 
in this quantity as, by definition, the news which are uploaded or liked by dead ends cannot spread to 
any other users, hence having many dead ends leads to a waste of user input and significantly reduces 
the diversity of news propagating over the network. 

• Link reciprocity, the tendency of node pairs to form connections between each other. It is defined as 
the ratio of the number of bi-dircctcd links to the total number of links in the network: 

2\2ij a ij a ji 

r := 

• Average differences. In a computer simulation we have the luxury of knowing users' taste vectors and 
hence we can compute in how many elements the taste vector of a user differs from the taste vector of 
her leader. By averaging over all users and all their leaders we obtain the average number of differences 
ad := jjx2~2i2~2ieLi P« — The fewer are the differences, the better is the assignment of leaders 
and the better the network has adapted to users' tastes. 
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• Recommendation efficiency. When making recommendations, it is possible to fall into two different 
kinds of error: recommending content that users does not like, and not recommending content that 
users does like. These errors are known respectively as false positives (FP) and false negatives (FN). To 
complete the picture, true positives (TP) are recommendations of content that users does like, and true 
negatives (TN) are lacks of recommendation of content that user does not like |35j . These quantities 
can be used to assess the quality of recommendations by measuring: 

TP 

1. Precision p := Tp_|_pp 

TP 

2. Recall 1 — p := r pp^pj\ | 

It is important to remark that in our model these quantities strongly depend on L, the number of 
leaders per user: big values of L imply more recommendations, which can have the effect of decreasing 
p and increasing 1 — f3, and vice versa. 



3 Results of agent-based simulations 

In this section we present the results of agent-based simulations of the social recommendation model. We first 
check if the system is able to improve the leader-follower network from its initial random configuration. Figure 
[1] shows the average differences at different times steps of the network's evolution for both homogeneous and 
heterogeneous settings. For readability reasons, we only report the evolution of the system under some of the 
definitions of the similarity measure (those, as will be discussed later, that are best performing). We observe 
that the average differences are gradually improved in the system, meaning that users are able to find better 
leaders — whatever the definition of the similarity. After an equilibration period, the system converges to a 
stationary network configuration (whose properties are reported in table [3J . The quality of the equilibrium 
state can be assessed as follows: 

• In the homogeneous setting, taste-mates differ in exactly two taste vector elements: the lowest possible 
value of the average differences is two, which corresponds to a globally optimal assignment of leaders in 
the network. Hence the quality of the final state can be estimated by how close the average differences 
are to two. Note that as for each user there are N = Da{D — Da) taste-mates, the optimal state is 
unique only if L = N. If L < N, there are different possible optimal states, which are determined 
by initial conditions and users' dynamics. If L > N (which however is not our case), the average 
differences are more than two even in the optimal state. 

• In the heterogeneous setting, taste-mates differ instead in only one taste vector element, and the lowest 
possible value of the average differences is hence one. In a situation where U = 2 D , there would be D 
taste-mates for each user, hence the optimal state would be unique if L = D, whereas if L < D there 
would be different optimal states, and finally if L > D the average differences would be greater than one 
even in the optimal state. However we are considering here only users with Da < h < D — Da, hence 
users with h = Da or h = D — Da only have D — Da taste-mates. Since we have set D — Da < L < D, 
we do not expect the average differences to ever reach one. In addition, we notice that here the number 
of taste-mates for each user is smaller compared to the homogeneous setting, hence getting close to the 
optimal state is more difficult. Finally, whereas in the homogeneous case the notion of taste-mates is 
simple, here we still have to investigate whether a good leader should have more or less active tastes 
than her followers — that would again reduce the number of available good leaders for each user. 
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Figure 1: Evolution of the average differences in the system for the homogeneous (a) and the heterogeneous 
(b) settings, and for some representative definitions of the similarity measure. 



Figure Q] shows that in both settings, with the system reaches the lowest average differences. With j( oi ) 
which only considers likes, one obtains the poorest performance — but the stationary state is not far from the 
optimal one. Using additional terms as in K^ 2L ^ and if( 2F ) brings the system very close to the final state of 
s' '. The stationary values of the average differences for these and other similarities are reported in table[3] 

Moving further, we investigate the leadership structure (figure [2]) and the properties of the network at 
equilibrium, as well as the performance of the recommender system (table EJ. As it was natural to expect, 
in the homogeneous setting the stationary degree distribution is rather homogeneous (with very few dead 
ends) and the reciprocity is high, whatever the definition of the similarity. The only exception is j(° F ); here 
users who happen to have few likes are not favored as leaders (/, |~| lj is not necessarily small compared to lj, 
but it is small compared to an average li), so they start to lose followers and the news they forward reach 
only a few users; this in turns causes li f] lj to remain small while U grows, so they again lose attractiveness, 
and so forth. As a results, these users end up with no followers. About the recommender system's perfor- 
mance, using s' ' (the best for average differences) yields the highest recall, whereas using K^ 2V) and K^ F ' 
the highest precision. Moreover, all similarities highlighted in the table are better than the others both in 
precision and recall. 

Things become more interesting in the heterogeneous setting. Here the stationary degree distribution has a 
longer tail (as it happens in many real systems) and the reciprocity is generally lower with respect to the 
homogeneous case, is still the best for average differences and recall, and the resulting network is again 
rather homogeneous (low de and high r), whereas K^ 2V) and K^ F ' have still the best precision (the gap 
with respect to increases here). However, for K^ 2F ^ the recall drops significantly, and this is caused by 
high de ratio. In this case, users with many likes are not favored {di p| lj is usually smaller than lj, but can 
be as big as an average di causing a —1 contribution to the similarity) and end up with no followers Q The 

4 This feature only appears in the heterogeneous setting which consists of users who by construction like more news than the 
others. In the homogeneous setting, a similar situation may arise only because of fluctuations which are however tamed by the 
first term \l{ (~] lj \/\lj |. 
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Figure 2: Cumulative distribution lunctions of the number of users' followers at equilibrium for the ho- 
mogeneous (a) and the heterogeneous (b) setting, and for some representative definitions of the similarity 
measure. 



Table 3: Properties of the network at equilibrium and performance of the recommender system (results are 
averaged over independent realizations). Values of de, p and 1 — f) are percentages. Note that the precision 
values in the two settings have as reference values the precisions of recommending at random, which are 
15.6% and 29.6% in the homogeneous and heterogeneous setting, respectively. 
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Table 4: Pearson correlation coefficients C(a*, s ) and C(Ah,s) between At or Ah and s for leader-follower pairs 
in equilibrium networks. The left column refers to the homogeneous setting (where Ahaj — Vi,j), the 
other two columns to the heterogeneous one. 
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same happens for K^ 1F ^ (for users with few dislikes, di |"| dj is not necessary small compared to dj, but it is 
small compared to an average di) and again for j( 0F \ 

Note that among the basic asymmetric indices, j( oi ) performs remarkably better than s( 0L ) and s^ 0F ^ — 
despite using only likes, and as expected it is better than J^). This justifies using j( oi ) for building the set 
of K similarities, with the result of K^ 2L ^ and K^ 2F ^ further increasing precision and recall in the system. 
Again, we can explain why these two metrics outstand the other Ks: K^ 2L ^ and K^ 2F ' are the only to feature 
in the second term the set lj, which is the most informative to consider as it contains all the news j passes 
to i — which determine i's satisfaction, and hence the quality of the recommendation process. 

We now investigate further the role of various users in the heterogeneous setting. In particular, we con- 
sider each leader- follower pair in the system and define their taste- vector difference Atij — \\ti — tj\\ and their 
scope difference Ah^j — hj — hi. We then study which pairs are favored by the used similarity measure by 
computing the Pearson correlation coefficients C(At, s ) (between At and s) and C(a/i, s ) (between Ah and s). 
Table S] shows these values, together with C(a*, s ) for the homogeneous setting as a benchmark. 
As expected, in the homogeneous setting all C(At, s ) are negative: s and At are anticorrelated, i.e., the smaller 
the difference, the higher the similarity, yields the highest c in absolute value, followed by K^ 2L ^ and 
K^ 2F \ We see that the ability to distinguish users' pairs with different At is directly related to final average 
differences and system's performance (table [3]) . 

In the heterogeneous setting, C(At. s ) values are still negative and very close to the ones of the previous 
setting — with the exception of s^ ' whose c is halved. With respect to C(Ah, s ), as expected, symmetric 
measures cannot distinguish Ahiy from Ahj\ i: hence we obtain no correlation in this case. j( 0F ' favors 
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Table 5: Pearson correlation coefficients of h with /, p and 1 — j3 in the heterogeneous setting. 





ChJ 




c M-/3 


s (0) 


0.18 


0.96 


-0.33 


7"(0) 


-U.ZO 


u.oy 


n 79 

-U. f Z 




-0.27 


0.89 


-0.54 


J(2) 


-0.62 


0.94 


-0.84 


7(3) 


O /I V 

U.4 / 


n no 


O Qfl 

-U.oU 


S (0L) 


0.04 


0.93 


0.15 


S (0F) 


0.06 


0.93 


-0.01 


J(0L) 


-0.74 


0.91 


-0.85 


J(0F) 


0.49 


0.96 


-0.48 




-0.56 


0.92 


-0.62 




-0.74 


0.92 


-0.75 




-0.74 


0.92 


-0.93 


K ( 2F ) 


-0.73 


0.91 


-0.89 


K& L ) 


-0.15 


0.93 


-0.88 


K (3F) 


0.12 


0.92 


-0.86 



instead users with higher h (more likes, as explained before). For other metrics, we generally observe a weak 
correlation, with some important exceptions: j( 2 \ j( 0L \ K^ 1F \ if( 2L ) and K^ 2F \ ordered by increasing 
anticorrelation. These metrics are able to distinguish user pairs with different scopes and favor users with 
smaller scope as leaders, which then leads to superior recommendations (table [3]). 

To show that this is the case, consider a user pair with Ai^ small and hi > hj => A/i.^ < : users i and j 
have very similar tastes, but j is slightly more demanding than i. Since j is z's leader, in the news spreading 
process j blocks all the news she dislikes so that i will like almost all the news that she gets from j. This 
means that j is a good leader for i as she forwards only the news matching i's tastes. If we do not consider 
missing evaluations, this situation can be roughly described as IZiH^'l — l^il; l^iD^'l — 0, 1^04/1 0, 
\di C\dj\ — \di\ < \dj\- The opposite case hi < hj is instead not favorable for i as in this case her leader j 
could also forward news that i would not like (as j is less demanding than i). Here |^fV:)l — \h\ < l^'li 
l^iD^'l 7^ l^n^j'l — 0, Min^jl — Mil < The situation is not symmetric and that is why the 

reciprocity coefficient in the heterogeneous setting drops. Given these two cases, j( 0L ) favors the first one 
as by definition, J^ L ^ ~ 1 if hi > hj but J^ L ^ = J., . < 1 if hi < h-j. The other K metrics are built from 

j( 0L ) by introducing additional terms. These terms for K^ 0L \ K^ F ^ and, in lesser degree@ K^ 1F ^ promote 
the first case (and hence users' satisfaction) even more, whereas the terms for K^ 1L \ if ( 3i ) and K^ 3F ^ go in 
the opposite direction: the corresponding overall C(a/i, s ) becomes close to zero and users' satisfaction lowers 
significantly. The system's behavior hence strongly depends on the particular similarity definition used. 

In addition we study how individual users' features depend on the scopes of interests h. Table [5] reports the 
Pearson correlation coefficients of h with / (number of users' followers), p and 1 — j3 (precision/recall of the 
recommendations for the user). Concerning the number of followers, users with small h (more demanding) 

5 We already discussed that the term \di {~}dj\ is not the most reliable for computing similarity. 
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Figure 3: Probability distributions for individual users' precision (upper panels) and recall (lower panels) 
values for the homogeneous [(a),(c)] and the heterogeneous [(b), (d)] settings. 
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become popular leaders when using j( 2 \ j( 0L \ K^ 1F \ if( 2i ) and K^ 2F \ as expected from the previous 
discussion. This feature is still present when using and K^ L \ as in these cases the effect of the 

damping terms (the ones in parenthesis) is to favor users with many dislikes, i.e. with small h. For s' ' 
instead the correlation is small but positive: users with high h are slightly favored as their bad filtering 
ability is not penalized enough by a symmetric measure, but on the contrary it is compensated by the big 
number and variety of news they forward (which increases the overlap of rating histories with any other 
user, and hence the probability of being chosen as a leader by that user). Finally, for j( 0F ) (and which 
features U in the denominator of the second term) users with small h are without followers — as a result of the 
previous discussion. Moving further, naturally precision is always better for users with high h, who are less 
demanding. Recall instead has the opposite trend: demanding users with small h are potentially interested 
in a few news, which they easily get from well-chosen leaders, whereas users with big h would be interested 
in many more news, but still have the same finite receiving capability. 

Finally we plot in figure [3] the probability distributions of precision and recall values for individual users. 
We observe that in the homogeneous setting these distribution have a well defined peak, whereas in the 
heterogeneous setting p and 1 — ft values vary more among users. This represents a natural (and expected) 
outcome of the particular way to model users' tastes. 

4 Testing on real data 

In this section, we test the similarity measures presented in Sec. 12. li on real data. To this end, we employ 
users' assessments from two online rating platforms to build two corresponding leader-follower networks for 
social recommendation (in the spirit of the model presented in this work) , and examine if the results obtained 
by artificial simulations can be recovered from real users' preferences. 
The dataset that we consider are from: 

• MovieLensH a movie recommendation website in which users can rate movies to get personalized 
recommendations. The dataset that we analyze is freely available at www.grouplens.org, and consists 
of over 1,000,000 anonymous ratings on approximately 3,900 movies made by about 6,000 MovieLens 
users since year 2000. The rating system is integer, with scale from 1 (worst) to 5 (best). 

• NetflixQ the world's largest online movie rental service where users can rate the movies they watch. 
The dataset that we analyze is freely provided for the Netflix Prize ( htt p: / /www.netflixprize.com[ ) , and 
consists of over 100,000,000 ratings from about 480,000 anonymous Netflix customers on 17,000 movie 
titles. Again, ratings are integers from 1 to 5. 

For each dataset, and for each similarity measure, we use the ratings given by users to build a leader- follower 
network: each user selects as leaders L = 10 other users which are the most similar to her. Since the sim- 
ilarity definitions in Sec. 12.11 are based on binary assessments (like/dislike), in order to restore this setting 
we regard ratings 4 and 5 as likes and ratings 1, 2, 3 as dislikes. In this way, we have almost the same 
number of likes and dislikes for each dataset. Note also that the process of selecting for each user the most 
similar leaders is rather demanding, requiring approximately U x M operations (here M is the total number 
of ratings). We hence restrict our analysis to subsets of 1,000 randomly selected users. For Netflix, we also 
consider only 4,000 randomly selected movies (and respective ratings). We then average our results over 

6 http:/ /www. movielens.org 
7 http: / /www. netflix. com 
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Table 6: Properties of the artificial leader- follower networks and quality of the recommendations (results 
are averaged over independent subset selections). Values of de, p and 1 — /3 are percentages. The precision 
values can be compared to the precisions of recommending at random (i.e. the percentages of likes among 
the ratings), which are 61.7% for MovieLens and 59.8% for Netflix. 
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various independent selections of the subsets. 

After the social network of users is constructed, movie recommendations for a given user are directly given 
as the movies liked by her leaders (we have this information from the real data itself). Note that in order 
to produce enough recommendations, we do not consider if in reality the user has rated a movie before her 
leaders, or if a movie recommended from a leader actually come from this leader's leaders, and we do not 
consider damping effects. In other words, we use the leader-follower network to have a single-step spreading 
system, in which every movie liked by a user in reality is recommended to her followers. We then obtain 
precision and recall for these recommendations. As before, precision is the ratio of good recommendations 
(i.e., recommendations for the movies that users liked in reality) to the total number of recommendations 
for the movies liked or disliked by users in reality, whereas recall is the ratio of good recommendations to 
the total number of movies liked in reality by users. 

Table [6] reports the properties of the constructed social networks and of the recommendations for the two 
datasets, and Figure |4] shows the leadership structure of the networks. We immediately notice that the 
degree distributions are broad and the reciprocity values are relatively low (compared to the values of Table 
[3]) — even with a symmetric measure s^>. As a consequence, there are many dead ends in the network. This 
kind of structure recalls the heterogeneous case more than the homogeneous one (cf . Table [3]) . Note that 
the differences between results from agent-based simulations and empirical findings (lower reciprocity and 
broader degree distribution in the latter case) can be attributed to the fact that the real scenario is even 
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Figure 4: Cumulative distribution function of the number of users' followers for MovieLens and Netflix. 



more heterogeneous than what the heterogeneous setting is supposed to model: real users arc diverse not 
only in their particular tastes and scopes of interest, but also in their behavior, features, social influence, 
and many other factors. For instance, by introducing users' heterogeneity in usage frequency and evaluation 
noise (as in |21j . but see also |22j). one observes a degree distribution which is very broad or even scale- free. 

As for recommendation metrics, we confirm that also for real data K^ 2L ^ and K^ 2F ^ are best in preci- 
sion, whereas is best for recall. The gap in precision between if( 2i ) and suggests that asymmetric 
and heterogeneous tastes better describe users' preferences. Finally, for K^- 1F ^ we notice a peculiar behavior 
that we do not observe in simulations: it produces the longest tails of P(k), few dead ends and very low reci- 
procity. The high value of recall obtained in this case can be attributed to this particular network structure. 

Finally, we plot in figure [5] the probability distributions of precision and recall values for individual users in 
the real datasets. Both for Movielens and Netflix, we observe that these distributions are broad, suggesting 
again that the heterogeneous vector setting represents a better way to model real users' tastes. 

5 Conclusion 

In this work, we have studied the role of taste similarity in social networks where social recommendation is 
at work. Building on the adaptive model of social recommendation introduced in |15j . we introduced two 
settings for modeling users' tastes, and used agent-based modeling to study the evolution and properties 
of the system under these different settings. After cross-checking our results with real data, we concluded 
that modeling users with different scopes of interests produces results closer to those obtained from real user 
actions. 

Besides modeling users' tastes, we tackled in this work also the complementary issue of quantifying taste 
similarity from users' actions. We investigated several definitions of users' similarity both in agent-based 
simulations and in real data, revealing which definitions are more reliable in measuring taste similarity, and 
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Figure 5: Probability distributions for individual users' precision (upper panels) and recall (lower panels) 
values for Movielens and Netflix. 
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more effective in constructing leader-follower networks for social recommendation. In particular, we showed 
that asymmetric similarities are able to reveal if a user is more or less demanding than another, and generally 
yield better results. In addition, we discussed which sets of actions are more appropriate to consider when 
building a taste similarity that can enhance both precision and recall of the social recommendation process. 

We remark that in our model we neglect some aspects of the dynamics of social recommendation and 
user behavior. For instance, real users are not infallible and thus they can commit mistakes in selecting their 
leaders. At the same time, each user can have her own selection criteria (i.e., her own similarity metric). 
Both these interesting aspects would require extensive consideration, and will be addressed in future works. 
Finally, our model assumes a passive relationship between leader and follower, with the latter interested 
to select the former by homophily. The issue of whether leaders might have an active role in changing 
the preferences of their followers by interacting with them is not discussed at all in this paper; in other 
words, we consider the time scale of eventual changes of preferences to be much larger than the lifespan of 
a leader-follower interaction, so that users' preferences remain unaltered. Extending our model to include 
more interactive relationships constitutes a fascinating research direction that we will pursue in the future. 
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